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Introduction 


A new  learning  decision  rule,  the  "Distribution  Free,  Minimum  Conditional 
Risk  Learning  System"  (DFMCR),  is  introduced  to  the  sonar  problem  in  refer- 
ence [l].  There,  its  performance  in  processing  four  classes  of  Tressi  feature 
vectors,  is  experimentally  determined  for  one  type  of  tolerance  region 
construction.  The  tolerance  region  concept  is  a basic  distribution  free  con- 
cept used  in  the  definition  of  the  DFMCR  decision  rule.  The  type  of  tolerance 
region  construction  used  in  reference  Tl]  is  a sequential  construction  called 
"bounding,  binning,  and  sub-binning."  This  tolerance  region  construction 
technique  has  the  advantage  that  it  involves  straight  lines  Vhlch  are  easy 
to  Implement  in  a generalised  digital  computer  or  using  a small,  special 
purpose  computer. 

Performance  calculations  for  the  DFMCR  decision  rule  are  given  in  refer- 
ence [2].  These  performance  calculations  assume  that  tolerance  regions  can  be 
constructed  from  supervised  samples  such  that  a tolerance  region  is  suffi- 
ciently small  such  that  the  underlying  probability  density  function  can  be 
resumed  uniform  within  the  region.  It  is  not  necessary  that  the  probability 
density  function  be  uniform  within  the  region — only  that  the  uniformity 
assumption  does  not  significantly  Increase  the  probability  of  error. 

The  tolerance  regions  constructed  using  straight  lines  in  reference  Cl] 
become  "small"  as  the  number  of  training  samples  becomes  large;  however,  for 
"small"  sample  sizes  ( such  as  200,  four-dimensional  tressi  vectors),  these 
tolerance  regions  may  not  be  as  small  as  possible  or  desired.  Therefore  it 
is  desirable  to  find  tolerance  regions  which  are  "relatively  small"  for 
small  sample  sizes. 

In  reference  [2]  an  example  Is  given  using  circular  tolerance  rfgi.jaf^c 
These  tolerance  regions  are  relatively  small  for  small  sample  sizes.  It  Is 
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shown  in  reference  [2]  that  a DFMCR  decision  rule  using  circular  tolerance 
regions  includes  such  decision  rules  as  the  "nearest  neighbor"  decision  rule 
as  special  case — thus  demonstrating  another  powerful  characteristic  of  the 
DFMCR  decision  rule.  A disadvantage  of  the  circular  tolerance  region,  however, 
is  that  it  is  more  complex  to  implement  than  the  tolerance  region  using 
straight  lines.  The  circular  tolerance  region  offers  no  complexity  reduction 
and  can't  be  used  for  large  sample  sizes.  For  large  sample  sizes,  a complexity 
reducing  tolerance  region  such  as  constructed  using  straight  lines  is  neces- 
sary. 

If  the  classes  concerned  are  separable,  then  it  is  expected  that  almost 
any  method  of  constructing  tolerance  regions  would  cause  the  DFMCR  decision 
rule  to  perform  with  zero  probability  of  error.  Because  the  sonar  classes 
represented  by  the  Tressi  feature  vectors  are  not  separable,  a circular  toler- 
ance region  should  produce  a lower  probability  of  error  than  a tolerance 
region  constructed  using  straight  lines.  This  report  includes  experimental 
results  for  the  performance  of  the  DFMCR  decision  rule,  using  circular  toler- 
ance regions  when  processing  Tressi  feature  vectors. 
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1.1, 


Circular  Tolerance  Regions  and  DFMCR  Deciaion  Rule 

The  DFMCR  decision  rule  using  circular  tolerance  regions  is  described 

as  follows:  Suppose  there  sure  four  sets  of  training  vectors  Indexed  by 

u^,  tt >2,  u)y  and  oj^  respectively.  Let  n..  training  vectors  be  available  for 

each  class  i«l,  2,  3,  such  that  n-n^+n^+n^+n^.  Then  a n+18^  vector 

sample  (called  the  candidate  sample)  is  presented  for  recognition.  The 

decision  rule  provides  for  storing  the  vector  samples  for  each  class  as 

shown  for  classes  «o^  and  o>g  in  Figure  1,  for  a two-dimensional  example. 

£ 

Suppose  that  the  candidate  sample,  denoted  by  X , is  as  shown  in  Figure  1. 

£ 

Then,  for  class  construct  a circ]e  cente-ed  at  X and  passing  through 
the  nearest  sairple  to  XC.  The  eyanple  i i Figure  1 is  for  v*3.  Then  the 
resulting  circles  have  all  the  properties  of  tolerance  regions  containing 
3 samples.  In  reference  [2]  it  is  shown  that  to  make  a decision  as  to  which 
class  caused  Xc,  decide  that  class  viose  tolerance  region  has  the  smallest 
volume  (minimizes  probability  of  error  assuming  equal  class  probabilities 
and  zero-one  loss  functions). 

Circular  tolerance  regions,  constructed  as  above,  were  used  in  the  DFMCR 
to  process  the  same  Tressi  vectors  previously  processed  using  tolerance  regions 
constructed  using  straight  lines.  The  experimental  performance  is  shown  in 
Table  1.  For  all  cases  considered  n ’nhle  1,  150  training  samples  were 
used  for  each  class.  For  recognition  r>0  samples  were  used  which  were  not 
part  of  the  training  set. 

The  results  shown  in  Table  1 should  be  compared  with  those  in  Table  k of 
reference  Cl].  By  doing  this  it  is  seen  that  circular  tolerance  regions 
result  is  considerable  itaprOtement  in  performance  over  tolerance  regions 


constructed  using  straight  lines. 
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The  experimental  procedure  used  in  obtaining  the  results  in  Table  1 are 
as  follows.  If  XC  is  from  a submarine  class  (w^,  or  an  error  is 

recorded  only  if  Xc  is  decided  from  class  ui^  (non  sub). 

Conclusions 

Circular  tolerance  regions  give  greater  performance  for  small  sample 
sizes  than  tolerance  regions  constructed  using  straight  lines.  For  situations 
where  the  vector  samples  are  not  separable  and/or  the  number  of  sample  vectors 
available  for  training  is  "relatively  small,"  circular  tolerance  regions 
should  be  used  Instead  of  tolerance  regions  constructed  using  straight  lines. 
The  circular  tolerance  regions  used  in  the  experiments  reported  were  con- 
structed with  v»3«  It  was  found  experimentally  that  v < 3 or  v > 3 resulted 
in  poorer  experimental  performance.  Theoretical  results,  with  v as  a 
parameter,  would  be  desirable. 

The  Tressl  feature  vectors  were  used  in  the  experiments  reported  herein 
for  the  purpose  of  comparison. 
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